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Method And Apparatus For Efficient Generation Of Texture Coordinate 
Displacements For Implementing Emboss-Stvle Bum p Mapping In A 

Graphics Rendering Svstem 

Cross-Reference to Related Applications 

5 This application claims the benefit of U.S. Provisional Application serial No. 

60/226,892, filed August 23, 2000, the entire content of which is hereby 
incorporated by reference. 

This application is also related to the following commonly assigned co- 
pending applications identified below (by title and serial number or attorney docket 

10 number), which focus on various aspects of the graphics system described herein. 
Each of the following applications are incorporated herein by reference: 

• provisional Application No. 60/161 ,915, filed October 28. 1999 and its 
corresponding utility Application No. 09/465,754. filed December 17, 1999, 
both entitled "Vertex Cache For 3D Computer Graphics": 

15 • provisional Application No. 60/226,912, filed August 23. 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-959), both entitled "Method and Apparatus for Buffering Graphics Data in 
a Graphics System "; 

• provisional Application No. 60/226,889, filed August 23. 2000 and its 

20 corresponding utility Application No. , filed (atty. dkt. no. 

723-958), both entitled "Graphics Pipeline Token Synchronization"; 



provisional Application No. 60/226,891, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-961), both entitled "Method And Apparatus For Direct and Indirect 
Texture Processing In A Graphics System"; 

provisional Application No. 60/226,888, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-968), both entitled "Recirculating Shade Tree Blender For A Graphics 
System"; 

provisional Application No. 60/226,893, filed August 23, 2000 and its 

corresponding utility Application No. filed (atty. dkt. no. 

723-962), both entitled "Method And Apparatus For Environment-Mapped 
Bump-Mapping In A Graphics System"; 

provisional Application No. 60/227,007, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-967), both entitled "Achromatic Lighting in a Graphics System and 
Method"; 

provisional Application No. 60/226,900, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-964), both entitled "Method And Apparatus For Anti- Aliasing In A 
Graphics System"; 



provisional Application No. 60/226,910, filed August 23, 2000 and its 

corresponding utility Application No. , filed fatty, dkt. no. 

723-957), both entitled "Graphics System With Embedded Frame Buffer 
Having Reconfigurable Pixel Formats"; 

utility Application No. 09/585,329, filed June 2, 2000, entitled "Variable Bit 

Field Color Encoding" (atty. dkt. no. 723-749); 

provisional Application No. 60/226,890, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-956), both entitled "Method And Apparatus For Dynamically 
Reconfiguring The Order Of Hidden Surface Processing Based On Rendering 
Mode"; 

provisional Application No. 60/226,915, filed August 23, 2000 and its 

corresponding utility Application No. filed (atty. dkt. no. 

723-973), both entitled "Method And Apparatus For Providing Non- 
Photorealistic Cartoon Outlining Within A Graphics System"; 
provisional Application No. 60/227,032, filed August 23, 2000 and its 

corresponding utility Application No. , filed , (atty. dkt. no. 

723-954), both entitled "Method And Apparatus For Providing Improved Fog 
Effects In A Graphics System"; 
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• provisional Application No. 60/226,885, filed August 23. 2000 and its 

corresponding utility Application No. , filed , (atty. dkt. no. 

723-969), both entitled "Controller Interface For A Graphics System"; 

• provisional Application No. 60/227.033. filed August 23. 2000 and its 

5 corresponding utility Application No. , filed (atty. dkt. no. 

723-955), both entitled "Method And Apparatus For Texture Tiling In A 
Graphics System"; 

• provisional Application No. 60/226,899, filed August 23, 2000 and its 
corresponding utility Application No. , filed (atty. dkt. no. 

10 723-97 1), both entitled "Method And Apparatus For Pre-Caching Data In 
Audio Memory"; 

• provisional Application No. 60/226,913, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-965), both entitled "Z-Texturing"; 

15 • provisional Application No. 60/227,03 1 , filed August 23, 2000 entitled 

"Application Program Interface for a Graphics System" (atty. dkt. no. 723-880), 

• provisional Application No. 60/227,030, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-963), both entitled "Graphics System With Copy Out Conversions Between 

20 Embedded Frame Buffer And Main Memory"; 



provisional Application No. 60/226,886, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-970), both entitled "Method and Apparatus for Accessing Shared 
Resources"; 

provisional Application No. 60/226,884. filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-972), both entitled "External Interfaces For A 3D Graphics and Audio 
Coprocessor"; 

provisional Application No. 60/226,894, filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-974), both entitled "Graphics Processing System With Enhanced Memory 
Controller"; 

provisional Application No. 60/226,914, filed August 23, 2000 and its 

corresponding utility Application No. , filed , (atty. dkt. no. 

723-966), both entitled " Low Cost Graphics System With Stitching Hardware 
Support For Skeletal Animation", and 

provisional Application No. 60/227,006. filed August 23, 2000 and its 

corresponding utility Application No. , filed (atty. dkt. no. 

723-953), both entitled " Shadow Mapping In A Low Cost Graphics System". 
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Field of the Invention 

The present invention relates to computer graphics, and more particularly to 
interactive graphics systems such as home video game platforms. Still more 
particularly this invention relates to efficient generation of texture coordinate 
5 displacements for implementing emboss-style bump-mapping effects for diffuse-lit 
textures on a rendered object. 

Background And Summary Of The Invention 

Many of us have seen films containing remarkably realistic dinosaurs, 
aliens, animated toys and other fanciful creatures. Such animations are made 

10 possible by computer graphics. Using such techniques, a computer graphics artist 
can specify how each object should look and how it should change in appearance 
over time, and a computer then models the objects and displays them on a display 
such as your television or a computer screen. The computer takes care of 
performing the many tasks required to make sure that each part of the displayed 

15 image is colored and shaped just right based on the position and orientation of each 
object in a scene, the direction in which light seems to strike each object, the 
surface texture of each object, and other factors. 

Because computer graphics generation is complex, computer-generated 
three-dimensional graphics just a few years ago were mostly limited to expensive 

20 specialized flight simulators, high-end graphics workstations and supercomputers. 
The public saw some of the images generated by these computer systems in movies 
and expensive television advertisements, but most of us couldn't actually interact 
with the computers doing the graphics generation. All this has changed with the 
availability of relatively inexpensive 3D graphics platforms such as. for example, 

25 the Nintendo 64® and various 3D graphics cards now available for personal 
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computers. It is now possible to interact with exciting 3D animations and 
simulations on relatively inexpensive computer graphics systems in your home or 
office. 

One problem that graphics system designers have often confronted in the 

5 past was the efficient rendering of a 3D object that displays realistic-looking 

surface characteristics that react to various lighting conditions in a manner similar 
to the surface of an actual object having, for example, random surface flaws, 
irregularities, roughness, bumps or other slight non-planar surface variations. 
While in some instances such minute surface characteristics might be actually 

10 modeled, the time required for translating and rendering a 3D object with such a 
complex surface would be prohibitive for most real-time or interactive gaming 
applications. Consequently, various solutions to this problem were offered. For 
example, a technique generally known as "bump-mapping" was developed which 
allowed one to approximate the effect that non-planar surface variations would 

15 produce on lighted object. See, for example, J. F. Blinn "Simulation of Wrinkled 
Surfaces" Computer Graphics, (SIGRAPH '78 Proceedings), vol. 12, No. 3, pp. 
286-292 (August 1978); "Models of Light Reflection for Computer Synthesized 
Pictures", Proc. 4 th Conference on Computer Graphics and Instructive Techniques, 
1977; and "Programming with OpenGL: Advanced Rendering" by Tom 

20 McReynolds and David Blythe - SIGGRAPH "97 course - Section 8.3 "Bump 
Mapping with Textures". Basically, this technique allows a graphics application 
programmer to add realism to an image without using a lot of geometry by 
modeling small surface variations as height differences and then applying those 
difference values over a surface as perturbations to a surface Normal vector used in 

25 computing surface lighting effects. Effectively, a bump-map modifies the shading 
of a polygon by perturbing the surface Normal on a per-pixel basis. The shading 
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makes the surface appear bumpy, even though the underlying geometry is 
relatively flat. 

Most conventional approaches toward implementing simple forms of bump- 
mapping effects with diffuse-lit textured surfaces generally entail computing, for 

5 each pixel the difference between a first sample of a bump map texture image at a 
particular texture coordinate and a second sample of the same texture image at a 
texture coordinate displacement. In addition, computing a texture coordinate 
displacement map generally involves computations using eye-space components of 
surface Tangent and Binormal vectors (binomials). In particular, to implement a 

10 simple form of bump-mapping having an embossing type effect on a texture image, 
it is most efficient to compute and apply the texture coordinate displacements in 
the eye-space (view-space/camera-space) reference frame — which is more 
conducive to a subsequent rasterizing process prior rendering for display. 
Consequently, texture coordinate displacement for emboss-style bump-mapping is 

15 preferably computed and generated after vertex position and surface binomials at a 
vertex are transformed from model-space into eye-space for pixel rendering. 

Typically, in low cost graphics processing systems such as a home video 
game system, vertex transformation and lighting (T&L) operations are commonly 
performed by the application program using the graphics system host CPU — 

20 primarily because a software T&L implementation, although more computationally 
taxing on the host CPU, is usually less expensive than using specialized hardware. 
Hardware implementation of T&L, however, may be preferable in gaming systems 
because it typically results in much faster renderings and can free up host CPU 
processing time for performing other desirable tasks such as game strategy and AI 

25 computations for improved game performance. Moreover, in graphics rendering 
arrangements where T&L operations are performed by the application software on 
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the host CPU, additional processing tasks such as performing texture coordinate 
computations for bump-mapping can significantly add to the processing overhead. 

In graphics rendering systems where the T&L operations are performed by 
dedicated graphics hardware, the host CPU typically provides model-space vertex 
5 attributes to the dedicated T&L hardware and then allows the hardware to perform 
all the coordinate space transformations and lighting computations. Consequently, 
it is not particularly efficient to require the host CPU to compute texture coordinate 
displacements for bump mapping purposes subsequent to the T&L hardware 
performing space transformations of the vertex position and surface 
10 normal/binormal vectors. Essentially, this would effectively undermine rendering 
speed improvements gained from utilizing dedicated T&L hardware whenever 
bump mapping operations are performed. 

The present invention solves this problem by providing techniques and 
arrangements in a graphics rendering system for the efficient generation of texture 

15 coordinate displacements for implementing at least an emboss- style bump- 
mapping texture effect without the need for the host CPU application software to 
compute the required texture coordinate displacements. An enhanced API 
(applications program interface) vertex attribute function capable of specifying 
three surface normals per vertex (i.e., the Normal, Tangent and Binomial) is 

20 utilized and the host CPU application software need only compute the required 
additional Tangent and Binormal surface vectors per vertex in object-space 
(model-space), in addition to providing the surface Normal and other conventional 
per- vertex attributes. 

Some of the features provided by aspects of this invention include: 

25 • use of a texture-combining unit capable of performing texture subtraction 

in one pass, 
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• use of texture combining for bump mapping that performs texture 
combining in texture hardware, 

• scaling of the binormals (Tangent and Binormal) by scaling a model view 
matrix and applying the model view matrix to the binormals, 

5 • computation of texture displacements using the Binormal and Tangent 

vectors but not the Normal input vector, 

• increased performance through use of two distinct dot product 
computation units (one dot unit performs model view matrix multiply, 
the second computes in parallel the dot products of the Tangent and 

10 Binormal with the light direction vector as well as the square of the light 

direction vector), 

• fully pipelined hardware can perform the necessary computations using a 
small number of distinct operations. 

In accordance with one aspect of the present invention, a graphics rendering 
15 system is provided with enhanced vertex transformation and lighting (T&L) 

hardware that is capable of performing at least simple emboss-style bump-mapping 
in addition to the conventional T&L operations. This style of bump-mapping is 
useful when the surface geometry of an object is being animated. The vector 
geometry processing portion of the T&L hardware is enhanced to accommodate 
20 processing a transformation of object-space vertex surface binormals (i.e., the 
Tangent and Binormal vectors) to eye-space and the computation of a texture 
coordinate displacement based on light direction (light-to-vertex) vector dot 
products with the transformed binormals. 

In accordance with another aspect of the present invention, an enhanced 
25 vertex attribute description API function provides three vertex surface normals (N, 
B and T) to the T&L vector geometry processing hardware along with vertex 
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position and light source position. The geometry processing hardware then 
transforms the surface normals to eye-space, computes the light vector in eye- 
space and uses the vector components to compute the appropriate texture 
coordinate displacements for use in producing an emboss-style bump mapped 
5 texture effect. 

Brief Description Of The Drawings 

These and other features and advantages provided by the invention will be 
better and more completely understood by referring to the following detailed 
description of presently preferred embodiments in conjunction with the drawings, 
10 of which: 

Figure 1 is an overall view of an example interactive computer graphics 
system; 

Figure 2 is a block diagram of the Figure 1 example computer graphics 
system; 

15 Figure 3 is a block diagram of the example graphics and audio processor 

shown in Figure 2; 

Figure 4 is a block diagram of the example 3D graphics processor shown in 
Figure 3; 

Figure 5 is an example logical flow diagram of the Figure 4 graphics and 
20 audio processor; 

Figure 6 is a flow chart illustrating example steps for implementing emboss- 
style bump mapping; 
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Figure 7 is an example logic flow diagram of vector processing and bump 
mapping hardware provided in the Transform unit for implementing emboss-style 
bump mapping; 

Figure 8 is a block diagram showing a detailed example of the dot-product 
5 computation units and light direction computation hardware provided in the 
Transform unit for implementing emboss-style bump mapping; and 

Figures 9 and 10 show example alternative compatible implementations. 

Detailed Description Of Example Embodiments Of The Invention 

Figure 1 shows an example interactive 3D computer graphics system 50. 
10 System 50 can be used to play interactive 3D video games with interesting stereo 
sound. It can also be used for a variety of other applications. 

In this example, system 50 is capable of processing, interactively in real 
time, a digital representation or model of a three-dimensional world. System 50 
can display some or all of the world from any arbitrary viewpoint. For example. 

15 system 50 can interactively change the viewpoint in response to real time inputs 
from handheld controllers 52a, 52b or other input devices. This allows the game 
player to see the world through the eyes of someone within or outside of the 
world. System 50 can be used for applications that do not require real time 3D 
interactive display (e.g., 2D display generation and/or non-interactive display), but 

20 the capability of displaying quality 3D images very quickly can be used to create 
very realistic and exciting game play or other graphical interactions. 

To play a video game or other application using system 50, the user first 
connects a main unit 54 to his or her color television set 56 or other display device 
by connecting a cable 58 between the two. Main unit 54 produces both video 
25 signals and audio signals for controlling color television set 56. The video signals 
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are what controls the images displayed on the television screen 59, and the audio 
signals are played back as sound through television stereo loudspeakers 61L. 61R. 

The user also needs to connect main unit 54 to a power source. This power 
source may be a conventional AC adapter (not shown) that plugs into a standard 
5 home electrical wall socket and converts the house current into a lower DC voltage 
signal suitable for powering the main unit 54. Batteries could be used in other 
implementations. 

The user may use hand controllers 52a, 52b to control main unit 54. 
Controls 60 can be used, for example, to specify the direction (up or down, left or 

10 right, closer or further away) that a character displayed on television 56 should 
move within a 3D world. Controls 60 also provide input for other applications 
(e.g., menu selection, pointer/cursor control etc.). Controllers 52 can take a 
variety of forms. In this example, controllers 52 shown each include controls 60 
such as joysticks, push buttons and/or directional switches. Controllers 52 may be 

15 connected to main unit 54 by cables or wirelessly via electromagnetic (e.g.. radio 
or infrared) waves. 

To play an application such as a game, the user selects an appropriate 
storage medium 62 storing the video game or other application he or she wants to 
play, and inserts that storage medium into a slot 64 in main unit 54. Storage 

20 medium 62 may, for example, be a specially encoded and/or encrypted optical 
and/or magnetic disk. The user may operate a power switch 66 to turn on main 
unit 54 and cause the main unit to begin running the video game or other 
application based on the software stored in the storage medium 62. The user may 
operate controllers 52 to provide inputs to main unit 54. For example, operating a 

25 control 60 may cause the game or other application to start. Moving other controls 
60 can cause animated characters to move in different directions or change the 
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user's point of view in a 3D world. Depending upon the particular software stored 
within the storage medium 62, the various controls 60 on the controller 52 can 
perform different functions at different times. 

Example Electronics of Overall System 

5 Figure 2 shows a block diagram of example components of system 50. The 

primary components include: 

• a main processor (CPU) 110, 

• a main memory 1 1 2, and 

• a graphics and audio processor 1 14. 

10 In this example, main processor 1 10 (e.g., an enhanced IBM Power PC 750) 

receives inputs from handheld controllers 108 (and/or other input devices) via 
graphics and audio processor 1 14. Main processor 1 10 interactively responds to 
user inputs, and executes a video game or other program supplied, for example, by 
external storage media 62 via a mass storage access device 106 such as an optical 

15 disk drive. As one example, in the context of video game play, main processor 110 
can perform collision detection and animation processing in addition to a variety of 
interactive and control functions. 

In this example, main processor 110 generates 3D graphics and audio 
commands and sends them to graphics and audio processor 114. The graphics and 
20 audio processor 114 processes these commands to generate interesting visual 

images on display 59 and interesting stereo sound on stereo loudspeakers 61R, 61L 
or other suitable sound-generating devices. 

Example system 50 includes a video encoder 120 that receives image signals 
from graphics and audio processor 1 14 and converts the image signals into analog 
25 and/or digital video signals suitable for display on a standard display device such 
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as a computer monitor or home color television set 56. System 50 also includes an 
audio codec (compressor/decompressor) 122 that compresses and decompresses 
digitized audio signals and may also convert between digital and analog audio 
signaling formats as needed. Audio codec 122 can receive audio inputs via a 
5 buffer 124 and provide them to graphics and audio processor 1 14 for processing 
(e.g., mixing with other audio signals the processor generates and/or receives via a 
streaming audio output of mass storage access device 106). Graphics and audio 
processor 1 14 in this example can store audio related information in an audio 
memory 126 that is available for audio tasks. Graphics and audio processor 1 14 
10 provides the resulting audio output signals to audio codec 122 for decompression 
and conversion to analog signals (e.g., via buffer amplifiers 128L, 128R) so they 
can be reproduced by loudspeakers 61L, 61R. 

Graphics and audio processor 114 has the ability to communicate with 
various additional devices that may be present within system 50. For example, a 
15 parallel digital bus 130 may be used to communicate with mass storage access 
device 106 and/or other components. A serial peripheral bus 132 may 
communicate with a variety of peripheral or other devices including, for example: 

• a programmable read-only memory and/or real time clock 134, 

• a modem 136 or other networking interface (which may in turn connect 
20 system 50 to a telecommunications network 138 such as the Internet or 

other digital network from/to which program instructions and/or data can 
be downloaded or uploaded), and 

• flash memory 140. 

A further external serial bus 142 may be used to communicate with 
25 additional expansion memory 144 (e.g., a memory card) or other devices. 
Connectors may be used to connect various devices to busses 130, 132, 142. 
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Example Graphics And Audio Processor 

Figure 3 is a block diagram of an example graphics and audio processor 114. 
Graphics and audio processor 1 14 in one example may be a single-chip ASIC 
(application specific integrated circuit). In this example, graphics and audio 
5 processor 1 14 includes: 

• a processor interface 150, 

• a memory interface/controller 152, 

• a 3D graphics processor 154, 

• an audio digital signal processor (DSP) 156, 
10 * an audio memory interface 158. 

• an audio interface and mixer 160, 

• a peripheral controller 162, and 

• a display controller 164. 

3D graphics processor 154 performs graphics processing tasks. Audio 
15 digital signal processor 156 performs audio processing tasks. Display controller 
164 accesses image information from main memory 112 and provides it to video 
encoder 120 for display on display device 56. Audio interface and mixer 160 
interfaces with audio codec 122, and can also mix audio from different sources 
(e.g., streaming audio from mass storage access device 106, the output of audio 
20 DSP 156, and external audio input received via audio codec 122). Processor 
interface 150 provides a data and control interface between main processor 1 10 
and graphics and audio processor 1 14. 

Memory interface 152 provides a data and control interface between 
graphics and audio processor 114 and memory 112. In this example, main 
25 processor 1 10 accesses main memory 1 12 via processor interface 150 and memory 
interface 152 that are part of graphics and audio processor 114. Peripheral 
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controller 162 provides a data and control interface between graphics and audio 
processor 1 14 and the various peripherals mentioned above. Audio memory 
interface 158 provides an interface with audio memory 126. 

Example Graphics Pipeline 

5 Figure 4 shows a more detailed view of an example 3D graphics processor 

154. 3D graphics processor 154 includes, among other things, a command 
processor 200 and a 3D graphics pipeline 180. Main processor 110 communicates 
streams of data (e.g., graphics command streams and display lists) to command 
processor 200. Main processor 1 10 has a two-level cache 115 to minimize 
10 memory latency, and also has a write -gathering buffer 111 for un-cached data 
streams targeted for the graphics and audio processor 1 14. The write-gathering 
buffer 111 collects partial cache lines into full cache lines and sends the data out to 
the graphics and audio processor 114 one cache line at a time for maximum bus 
usage. 

15 Command processor 200 receives display commands from main processor 

110 and parses them — obtaining any additional data necessary to process them 
from shared memory 112. The command processor 200 provides a stream of 
vertex commands to graphics pipeline 180 for 2D and/or 3D processing and 
rendering. Graphics pipeline 180 generates images based on these commands. 

20 The resulting image information may be transferred to main memory 1 12 for 
access by display controller/video interface unit 164 — which displays the frame 
buffer output of pipeline 180 on display 56. 

Figure 5 is a logical flow diagram of graphics processor 154. Main 
processor 110 may store graphics command streams 210. display lists 212 and 
25 vertex arrays 214 in main memory 1 12, and pass pointers to command processor 



18 



200 via bus interface 150. The main processor 1 10 stores graphics commands in 
one or more graphics first-in-first-out (FIFO) buffers 210 it allocates in main 
memory 110. The command processor 200 fetches: 

• command streams from main memory 1 12 via an on-chip FIFO memory 
5 buffer 216 that receives and buffers the graphics commands for 

synchronization/flow control and load balancing, 

• display lists 212 from main memory 1 12 via an on-chip call FIFO 
memory buffer 218, and 

• vertex attributes from the command stream and/or from vertex arrays 214 
10 in main memory 1 12 via a vertex cache 220. 

Command processor 200 performs command processing operations 200a 
that convert attribute types to floating point format, and pass the resulting complete 
vertex polygon data to graphics pipeline 180 for rendering/rasterization. A 
programmable memory arbitration circuitry 130 (see Figure 4) arbitrates access to 
15 shared main memory 112 between graphics pipeline 180, command processor 200 
and display controller/ video interface unit 164. 

Figure 4 shows that graphics pipeline 180 may include: 

• a transform unit 300, 

• a setup/rasterizer 400, 
20 • a texture unit 500, 

• a texture environment unit 600, and 

• a pixel engine 700. 

Transform unit 300 performs a variety of 2D and 3D transform and other 
operations 300a (see Figure 5). Transform unit 300 may include one or more 
25 matrix memories 300b for storing matrices used in transformation processing 300a. 
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Transform unit 300 transforms incoming geometry per vertex from object space to 
screen space; and transforms incoming texture coordinates and computes 
projective texture coordinates (300c). Transform unit 300 may also perform 
polygon clipping/culling (300d). Lighting processing 300e also performed by 
5 transform unit 300b provides per vertex lighting computations for up to eight 
independent lights in one example embodiment. As discussed herein in greater 
detail, Transform unit 300 also performs texture coordinate generation (300c) for 
emboss-style bump mapping effects. 

Setup/rasterizer 400 includes a setup unit which receives vertex data from 
10 transform unit 300 and sends triangle setup information to one or more rasterizer 
units (400b) performing edge rasterization, texture coordinate rasterization and 
color rasterization. 

Texture unit 500 (which may include an on-chip texture memory (TMEM) 
502) performs various tasks related to texturing including for example: 

15 • retrieving textures 504 from main memory 112, 

• texture processing (500a) including, for example, multi-texture handling, 
post-cache texture decompression, texture filtering, embossing, shadows 
and lighting through the use of projective textures, and BLIT with alpha 
transparency and depth, 

20 • bump map processing for computing texture coordinate displacements for 

bump mapping, pseudo texture and texture tiling effects (500b), and 

• indirect texture processing (500c). 

Texture unit 500 performs texture processing using both regular (non- 
indirect) and indirect texture lookup operations. A more detailed description of the 
25 example graphics pipeline circuitry and procedures for performing regular and 
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indirect texture look-up operations is disclosed in commonly assigned co-pending 

patent application, Ser. No. , entitled "Method And Apparatus For 

Direct And Indirect Texture Processing In A Graphics System" (attorney docket 
no. 723-961) and its corresponding provisional application, serial no. 60/226,891, 
5 filed August 23, 2000, both of which are incorporated herein by reference. 

Texture unit 500 outputs filtered texture values to the Texture Environment 
Unit 600 for texture environment processing (600a). Texture environment unit 600 
blends polygon and texture color/alpha/depth, and can also perform texture fog 
processing (600b) to achieve inverse range based fog effects. Texture environment 

10 unit 600 can provide multiple stages to perform a variety of other interesting 
environment-related functions based for example on color/alpha modulation, 
embossing, detail texturing, texture swapping, clamping, and depth blending. 
Texture environment unit 600 can also combine (e.g., subtract) textures in 
hardware in one pass. For more details concerning the texture environment unit 

15 600, see commonly assigned application serial no. entitled 

"Recirculating Shade Tree Blender for a Graphics System" (attorney docket no. 
723-968) and its corresponding provisional application, serial no. 60/226,888, filed 
August 23, 2000, both of which are incorporated herein by reference application. 

Pixel engine 700 performs depth (z) compare (700a) and pixel blending 
20 (700b). In this example, pixel engine 700 stores data into an embedded (on-chip) 
frame buffer memory 702. Graphics pipeline 180 may include one or more 
embedded DRAM memories 702 to store frame buffer and/or texture information 
locally. Z compares 700a r can also be performed at an earlier stage in the graphics 
pipeline 180 depending on the rendering mode currently in effect (e.g., z compares 
25 can be performed earlier if alpha blending is not required). The pixel engine 700 
includes a copy operation 700c that periodically writes on-chip frame buffer 702 to 
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main memory 1 12 for access by display/video interface unit 164. This copy 
operation 700c can also be used to copy embedded frame buffer 702 contents to 
textures in the main memory 1 12 for dynamic texture synthesis effects. Anti- 
aliasing and other filtering can be performed during the copy-out operation. The 
5 frame buffer output of graphics pipeline 180 (which is ultimately stored in main 
memory 1 12) is read each frame by display/video interface unit 164. Display 
controller/video interface 164 provides digital RGB pixel values for display on 
display 102. 

Example Emboss-Style Bump Mapping Texture Coordinate Generation 

10 Figure 6 is a flowchart showing an example set of basic processing steps 

used to perform emboss bump-mapping in the system described above. In the 
example embodiment, most of the Figure 6 steps are performed by Transform Unit 
300 based on per- vertex Tangent and Binormal vector data supplied by Command 
Processor 200. Command Processor 200 may obtain such per- vertex values from 

15 main processor 110 and/or from main memory 112. 

Briefly, the graphics pipeline renders and prepares images for display at 
least in part in response to polygon vertex attribute data and texel color data stored 
as a texture image in an associated memory. The graphics rendering pipeline is 
provided with vertex transformation and lighting (T&L) hardware that is capable 

20 of performing simple bump-mapping operations in addition to the more 

conventional T&L operations. Pipelined hardware efficiently generates texture 
coordinate displacements for implementing emboss-style bump-mapping effects 
utilizing object-space (model-space) surface normals supplied per vertex, for 
example, by a graphics application running on the main CPU of the graphics 

25 system. An enhanced vertex attribute description command function facilitates the 
communication and processing of plural surface normals per- vertex in addition to 
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other vertex attributes such as vertex position, light source position and texture 
coordinates. The enhanced vertex attribute function specifies Normal Tangent and 
Bmormal surface vectors (N, T & B) provided by the host CPU in object space 
coordinates and uses separate memory indexes per vertex for each of the three 
5 surface vectors so as to effectively compress the amount of data needed for bump 
mapping. A vector geometry processing portion of the T&L hardware is also 
enhanced by providing two distinct dot-product computation units to transform the 
Tangent and Binormal surface vectors to eye-space using a scaled model view 
matrix, compute a light direction vector in eye-space and perform parallel dot- 
10 product computations between the computed light direction vector and the 

transformed Tangent and Binormal vectors to efficiently generate the appropriate 
texture coordinate displacements for use in creating an embossed texture effect. 

In one example embodiment, system 50 first stores a texture image in texture 
memory 502 (see Figure 4) for use with the bump mapping operation (block 800). 

15 Command Processor 200 then provides object-space basis Tangent and Binormal 
vector data to transform Transform Unit 300 using vertex attribute functions 
defined in an appropriate graphics API (block 802). The Transform Unit 300 
transforms the Tangent and Binormal vector data to eye space (block 804). 
Transform Unit 300 also computes a light direction (light-to- vertex) vector and a 

20 normalized light direction vector (block 806). Transform Unit 300 then computes 
texture coordinate displacements and new texture coordinate values per vertex 
(blocks 808, 810). Texture Environment (TEV) unit 600 develops an embossed 
texture from the original texture stored in texture memory 502 minus the offset 
texture defined by the displacements (block 812). In other words, the original 

25 texture is looked-up using both non-displaced coordinates (s, t) and displaced 
coordinates (s + As, t + A t) and the texture values are subtracted per-pixel. The 
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result is combined with per- vertex local diffuse lighting in graphics pipeline 180 
and the resulting embossed image is rendered for display on display 56 (block 
814). The embossed image results may also be combined with other textures. 

In more detail, bump mapping described above generates at least: (1) texture 
5 coordinate displacements (As, At) based on incoming texture coordinates (block 
808), (2) a normalized light direction (block 806) and (3) a per- vertex coordinate 
basis function (block 802). The preferred basis function used is an orthogonal 
object-space coordinate basis. The three orthogonal axes of this coordinate basis 
are defined by the surface Normal vector, a surface "Tangent" vector and a second 

10 mutually perpendicular surface tangent "Binomial" vector with the Tangent (T) 
and Binormal (B) vectors oriented in directions corresponding to the texture 
gradient in s and the texture gradient in t (i.e., increasing s or t). The two 
orthogonal surface tangent vectors, T and B, are also called "binormals". Block 
802 provides these values. An object-space coordinate light vector projected onto 

15 this coordinate basis (block 806) is then used to compute texture coordinate 

displacements for bump-mapping. More specifically, the projection of the light 
direction vector onto each of the two binomials, T and B, gives the amount of 
texture space displacement the light causes. Basically, the light on the texture (i. 
e., the light direction vector) is decomposed into its surface normal component and 

20 its (s, t) coordinate components corresponding to the respective texture gradients. 
These (s, t) coordinate components of the light direction vector are the (As, At) 
texture coordinate displacements (block 808) used for bump mapping. 

To perform the above operations properly for efficient rendering, object 
oriented Tangent and Binormal vectors at each vertex, which map in object space 
25 to the texture s and t axis, are preferably first converted to eye-space. 

Consequently, in the example implementation of the present invention, Command 
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Processor 200 supplies these two binormals per-vertex to Transform Unit 300 
(block 804). The Transform Unit will then transform the binormals to eye-space 
(block 804). (For the present example embodiment, even where the supplied 
binomials are constant, for example, with flat surfaces, Command Processor 200 
5 supplies the binormals to Transform Unit 300 on a per-vertex basis.) 

Mathematically, the following operations are performed by Transform Unit 300 
are: 



Tx 

Ty = Module Vie\vNormalMatrix(3x3) 
Tz 
'Bx 

By = ModelViewNormalMatrix(3x3) 
Bz 



Txv 
Tyv 
Tzv 
Bxv' 
Byv 
Bzv 



where Tv = (Txv, Tyv, Tzv) and Bv = (Bxv, Byv, Bzv) are the per-vertex 
10 binormals supplied to Transform Unit 300 by Command Processor 200. The Tv 
vector should preferably be normalized and aligned with the s texture axis in 
object-space and the Bv vector should preferably be normalized and aligned with 
the t texture axis in object space. The Model View transformation matrix should be 
purely rotational, which would maintain the unit length of the binormals. However, 
15 if scaling of the binormals is required, then the Model View transformation matrix 
can be multiplied by a scalar. The scale applied would then be the new unit length 
of the binormals. This could be used to visually increase the bump mapping effect 
without changing the source data or the algorithm. 

Given the binormal basis system, the light rotation matrix used by 
20 Transform Unit 300 (block 806) is as follows: 
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Tx Ty Tz 
Bx By Bz 



where (Tx, Ty, Tz) is the transformed binormal oriented along the s axis, in 
the direction of increasing s, while (Bx, By, Bz) is the transformed binormal 
oriented along the t axis, in the direction of increasing t. 

The light vector is computed (block 806) by normalizing the difference 
between the light position (in eye-space) and the current, transformed, vertex in 
eye space as follows: 







Vx - Lpx 


Lx 
Ly 
Lz 




Vy - Lpy 
Vz - Lpz 




\Vx - Lpz 
Vy - Lpy 
Vz - Lpz 



The texture coordinate displacement (As, At) is then computed per- vertex 
(block 808) as follows: 



"As" 




'Tx Ty Tz~ 






Bx By Bz 



Lx 
Ly 
Lz 
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Note that this preferred example algorithm does not use the Normal input vector to 
compute displacements. Only the Binomial and Tangent vectors are required. 
Other implementations specify a 3x3 matrix multiply including the eye-space 
Normal as an extra row. 

5 The computed per-vertex delta offsets, (As. At), are then added to the post- 

transform (i.e., after transform to eye-space) texture coordinate generated per- 
vertex (block 810) to obtain new texture coordinates SI and Tl: 



"SI" 




SO + As 




"so" 


+ 










_T0_ 





Tx 7v Tz 1 
Bx Bv Bzj 



Lx 
Ly 
Lz 



Example Emboss Bump-Mapping Texture Coordinate Generation 
10 Hardware Implementation 

To efficiently implement the above computation for emboss-style bump- 
mapping, Transform Unit 300 includes hardwired computational logic circuitry to 
perform at least the following emboss bump-mapping related vector and coordinate 
computations: 

15 • Compute T eye = MV»T 

Compute B eye = MV«B 
Compute L = V eye - L pos 
Compute L 2 
Compute T eye # L 
20 • Compute Beye^L 

Compute 1/| |L| |= 1/sqrt (L 2 ) 
Compute As = T«L/| |L| | 
Compute At = B-L/ 1 1 L j | 
Compute (SI, Tl) = (SO + As, TO + At) 
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where T and B are the respective object-space Tangent and Binomial 
vectors; MV is a transformation matrix having element values for converting 
vectors to eye-space; is the light position vector; V eye is the vertex position 
vector; L is the light-to- vertex vector; ( | L | ] is the normalized light direction 
5 vector; (SO, TO) are the regular transformed texture coordinates, (As, At), are the 
generated texture coordinate displacement values; and (SI, Tl) are the new texture 
coordinates from which an "offset" texture used in emboss bump-mapping is 
obtained. 

Figure 7 is a block diagram illustrating the logical flow of functional 

10 operations performed by hardware provided in transform unit 300 for efficiently 
generating the texture coordinate displacements, (As, At), needed for implementing 
emboss-style bump mapping. The graphics application running on main processor 
1 10 computes and supplies object-space binormals T and B to transform unit 300 
via command processor 200. An enhanced vertex attribute description function of 

15 the API (application programming interface) is used which allows specifying 
Normal, Tangent and Binormal vector data via separate memory indexes 
per- vertex. Command processor 200 provides this vector data to transform unit 
300. In addition to permitting at least three surface normals, the enhanced vertex 
attribute function allows the programmer to use separate per-vertex memory 

20 indexes for each of the three surface normals so as to effectively compress the 

amount of data needed to be explicitly specified for bump-mapping. As mentioned 
above, the supplied Tangent and Binormal vectors must map, at each vertex, to the 
texture s and t axis, in object space. Vector dot-product multiplication circuitry in 
Transform Unit 300 will then transform these vectors to eye-space, as illustrated by 

25 Figures 7 and 8. 
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Referring again to Figure 7, block 300f outlines specific vector dot-product 
processing hardware which may also be employed by Transform Unit 300 in 
performing computations other than that related to emboss-style bump-mapping. 
While block 300g outlines Transform Unit hardware more specifically useful in 

5 emboss bump-mapping computations, block 300g hardware may also be useful in 
performing other functions. For emboss-style bump-mapping, a first dot-product 
computation unit 301, computes the eye-space transformation of the Tangent and 
Binormal vectors. The transformed results are temporarily stored in 
multiplexing/staging buffer 302. A light-to-vertex vector computation 304 is 

10 performed on vertex position vector data, V eye? and light position vector data L pos , 
to provide light direction vector data, L. A second dot-product computation unit 
303 is utilized to compute, in parallel, the following: 

• vector dot-products between light direction vector L and each Binormal 
vector T and B; and 
15 • an L 2 vector product from light direction vector L. 

The L 2 vector product is subsequently provided to inverse square-root 
computation unit 305 for computing an inverse magnitude value of the light 
direction vector. The Binormal and Tangent vector lighting dot-products T»L and 
B«L from dot unit 303 are provided to floating multiplier 306 along with the 
20 computed inverse magnitude value of the light direction vector from unit 305. 
Floating point multiplier 306 then computes the texture coordinate displacements 
AS and AT which are passed to floating point adder 308. Transformed texture 
coordinates SO and TO are provided per vertex to delay FIFO 307 and are passed in 
a timely fashion to floating point adder 308 for combination with computed 
25 coordinate displacements AS and AT. The new texture coordinates generated, SI 
and Tl, are then passed to a vertex buffer unit (not shown) within transform unit 
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300 and subsequently passed via graphics pipeline 180 to texture unit 500 for 
texture lookup. In the preferred embodiment, the texture combining unit used is 
capable of performing texture subtraction in one pass instead of multiple passes. 
The preferred texture combining operation does not use an accumulation buffer. 

5 but instead does texture combining in texture hardware. 

Figure 8 shows a more detailed diagram of dot-product computation units 

301 and 303 and light direction computation hardware within transform unit 300 
for performing the emboss bump mapping functions of Figure 7. A preferred 
embodiment utilizes digital logic hardware capable of processing at least twenty- 

10 bit floating point numerical values. As illustrated in Figure 8. vector dot unit 301 
includes floating point multipliers 310, 31 1 and 312 coupled to floating adders 313 
and 314. The two surface binomials B and T are provided to floating point 
multipliers 310, 311 and 312 from vertex cash RAM 220 (Figure 5). Input FIFO 
315 receives transformation matrix data for converting the Tangent and Binormal 

15 vectors to eye-space from matrix memory 300b and provides the matrix element 
values to floating point multipliers 310, 31 1 and 312. Floating point adder 304 
performs the light-to-vertex vector computation for determining light direction 
vector L from transformed eye-space vertex position data and input light 
position/direction vector data. 

20 Vector dot unit 303 includes floating multipliers 3 1 7. 3 1 8 and 3 1 9 and 

floating point adders 320 and 321 for computing vector dot products of the light 
direction vector and the Tangent and Binormal eye space vector components. Dot 
unit 303 may also include multiplexor 302 for receiving and staging light direction 
vector and transformed eye-space Tangent and Binormal vector data from floating 

25 point adder 304 and dot unit 301 . Floating point multipliers 317 through 3 19 are 
used in combination with floating point adders 320 and 321 to provide a light 
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direction vector squared product, L 2 , a Tangent lighting vector dot-product (T»L) 
and a Binormal lighting dot product (B L) at the output of floating point adder 321. 

A table illustrating an example schedule of computational events for 
accomplishing emboss-style bump-mapping occurring per pipeline data clocking 
cycle/stage within Transform Unit 300 using dot unit 301 and dot unit 302 is 
provided immediately below: 



Cycle # Vector Dot Unit #1 VF P Adder Vector Dot Unit #2 

1 LoadT 

2 T xe =M0*T 

3 T ye =Ml»T 

4 T ze =M2»T 

5 Load B 

6 B xe = M0*B 

7 B ye = M1»B 

8 B ze = M2»B Lx=Vex-Lpx 

9 Ly=Vey-Lpy 

10 Lz=Vez-Lpz 

1 1 Out T xe 

12 OutT ye 

13 OutT ze 

14 LdT ey e>L 

15 OutB xe Out T*L; Ld L 

16 Out B y e OutL 2 

17 Out B ze 

18 Ld B e ye 

19 Out B»L 



During relative cycles/stages numbered 1 through 8 ? the Tangent and 
Binormal vectors are loaded into dot unit 301 and the transforms to eye space are 
computed. During cycles 9 through 1 1, light direction vector components L x , L y , 
and L z are computed by floating point adder 304 using eye space vertex 
composition components and negative signed light position components. During 
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cycles 1 1-13, the computed Tangent vector eye space components are loaded into 
multiplexing/staging buffer 302. During Cycle 14. the computed light direction 
vector, L, and the computed Tangent eye space vector. Teye = (T xe? T ye? T ze ), are 
loading into the vector dot unit 303 for computing the T»L dot product. On cycle 
5 15, the computed light direction vector, L, is again loaded into the vector dot unit 
303 to compute the light direction vector squared product, L2. Finally, the 
binormal eye space vector, Beye = (B xe , B ye? B a ), is loaded on cycle 18 to compute 
the B*L dot product. The hardware described above is fully pipelined and can 
compute the required values in a minimal number of distinct operations. 

10 Example API Function Commands 

In the preferred embodiment, an enhanced graphics API function is used to 
initiate texture coordinate generation within transform unit 300. In addition to 
conventional texture coordinate generation wherein current vertex attribute 
information is used to generate a texture coordinate, the preferred graphics API 
15 supports an enhanced texture generation function that is capable of calling and 
using other texture coordinate generation functions. An example enhanced API 
texture coordinate generation function may be defined as follows: 



GXSetTexCoordGen 



Arguments: 
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GXTexCoord 



DstjCoord; II name of generated texture coordinates 
Func; If coordinate generation function type 
Src j?aram; II Source parameters for coord generation 
Matldx; II Texture Matrix Index. 



GxtexGenType 
GXTexGenSrc 



u32 
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The above example API function defines general texture coordinate 
generation in addition to supporting other texture coordinate generation functions. 
The Matlclx is set as the default texture matrix index by which the generated 
texture coordinates are to be transformed. In the present example embodiment, to 
5 implement emboss-style bump-mapping, the above API function is used with Func 
set to GX__TG_BUMP*, where * is a number from 0-7 indicative of one of up to 
eight possible different lights (light source positions) which may be selected for 
embossing. 

The following is an example C/C++ language implementation of the above 
10 general texture coordinate generation function: 

void GXSetTexCoordGeni 

GXTexCoordID dst_coor& 
GXTexGenType func, 
GXTexGenSrc src_param, 
15 u32 mtx); 

With "func" set to GX_TG_BUMP0-7, system 50 performs emboss-style 
bump mapping by perturbing input texture coordinates based on per- vertex 
specified binormals and light direction information. The original and offset texture 
coordinates are used to look up texels from a height-field bump map texture stored 
20 in memory 112. TEV unit 600 can be used to subtract these values in hardware in 
one pass to find the bump height, which value can be added to the final color of the 
pixel to provide emboss-style bump mapping. GX_BUMP0 indicates that light 0 
will be used, GX_BUMP1 indicates that light 1 will be used, etc., in the bump map 
calculation. 

25 The dst_coord for bump maps should be numbered sequentially, i.e. base 

texture coordinate = n, and bump offset texture coordinate = n+1 . Bump map 
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texture coordinates should be generated after coordinates generated from 
transforms (GX„TGJVITX2x4 and GXJTG_MTX3x4) and before coordinates 
generated from lighting channels (GX_TG_SRTG). An example follows: 

// source for a bump mapped coordinate, transformed by a matrix 

5 GXSetTexCoordGen(GX__TEXCOORD0, GX_TG„MTX2x4, 

GX_TG_TEX0, GX_TEXMTX3); 

// perturbed coordinate, offset from TEXCOORDO above, light 0. Matrix 
(mtx) is not used for the perturbed coordinates (therefore use an identity matrix). 

GXSetTexCoordGen(GX_TEXCOORD 1 , GX_TG_BUMP0, 
10 GX_TGJTEXCOORD0, GXJDENTITY). 



Other Example Compatible Implementations 

Certain of the above-described system components 50 could be 
implemented as other than the home video game console configuration described 

15 above. For example, one could run graphics application or other software written 
for system 50 on a platform with a different configuration that emulates system 50 
or is otherwise compatible with it. If the other platform can successfully emulate, 
simulate and/or provide some or all of the hardware and software resources of 
system 50, then the other platform will be able to successfully execute the 

20 software. 

As one example, an emulator may provide a hardware and/or software 
configuration (platform) that is different from the hardware and/or software 
configuration (platform) of system 50. The emulator system might include 
software and/or hardware components that emulate or simulate some or all of 
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hardware and/or software components of the system for which the application 
software was written. For example, the emulator system could comprise a general 
purpose digital computer such as a personal computer, which executes a software 
emulator program that simulates the hardware and/or firmware of system 50. 

5 Some general purpose digital computers (e.g., IBM or Macintosh personal 

computers and compatibles) are now equipped with 3D graphics cards that provide 
3D graphics pipelines compliant with DirectX or other standard 3D graphics 
command APIs. They may also be equipped with stereophonic sound cards that 
provide high quality stereophonic sound based on a standard set of sound 

10 commands. Such multimedia-hardware-equipped personal computers running 
emulator software may have sufficient performance to approximate the graphics 
and sound performance of system 50. Emulator software controls the hardware 
resources on the personal computer platform to simulate the processing, 3D 
graphics, sound, peripheral and other capabilities of the home video game console 

15 platform for which the game programmer wrote the game software. 

Figure 9 illustrates an example overall emulation process using a host 
platform 1201, an emulator component 1303, and a game software executable 
binary image provided on a storage medium 62. Host 1201 may be a general or 
special purpose digital computing device such as, for example, a personal 

20 computer, a video game console, or any other platform with sufficient computing 
power. Emulator 1303 may be software and/or hardware that runs on host 
platform 1201, and provides a real-time conversion of commands, data and other 
information from storage medium 62 into a form that can be processed by host 
1201. For example, emulator 1303 fetches "source" binary-image program 

25 instructions intended for execution by system 50 from storage medium 62 and 
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converts these program instructions to a target format that can be executed or 
otherwise processed by host 1201. 

As one example, in the case where the software is written for execution on a 
platform using an IBM PowerPC or other specific processor and the host 1201 is a 
5 personal computer using a different (e.g., Intel) processor, emulator 1303 fetches 
one or a sequence of binary-image program instructions from storage medium 62 
and converts these program instructions to one or more equivalent Intel binary- 
image program instructions. The emulator 1303 also fetches and/or generates 
graphics commands and audio commands intended for processing by the graphics 

10 and audio processor 1 14, and converts these commands into a format or formats 
that can be processed by hardware and/or software graphics and audio processing 
resources available on host 1201. As one example, emulator 1303 may convert 
these commands into commands that can be processed by specific graphics and/or 
or sound hardware of the host 1201 (e.g., using standard DirectX, OpenGL and/or 

15 sound APIs). 

An emulator 1303 used to provide some or all of the features of the video 
game system described above may also be provided with a graphic user interface 
(GUI) that simplifies or automates the selection of various options and screen 
modes for games run using the emulator. In one example, such an emulator 1303 
20 may further include enhanced functionality as compared with the host platform for 
which the software was originally intended. In the case where particular graphics 
support hardware within an emulator does not include the embossed bump 
mapping functions shown in Figures 7 and 8. the emulator designer has a choice of 
either: 

25 • translating emboss-style bump mapping commands into other graphics 

API commands the graphics support hardware understands, or 
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• implementing the bump mapping functions in software with a potential 
corresponding decrease in performance depending upon the speed of the 
processor, or 

• "stubbing" (i.e., ignoring) the bump mapping commands to provide a 
5 rendered image that does not include embossed effects. 

While the Figure 6 flowchart can be implemented entirely in software, 
entirely in hardware or by a combination of hardware and software, the preferred 
embodiment performs most of these calculations in hardware to obtain increased 
speed performance and other advantages. Nevertheless, in other implementations 
10 (e.g., where a very fast processor is available), the computations and steps of 

Figure 6 may be implemented in software to provide similar or identical imaging 
results. 

Figure 10 illustrates an emulation host system 1201 suitable for use with 
emulator 1303. System 1201 includes a processing unit 1203 and a system 

15 memory 1205. A system bus 1207 couples various system components including 
system memory 1205 to processing unit 1203. System bus 1207 may be any of 
several types of bus structures including a memory bus or memory controller, a 
peripheral bus. and a local bus using any of a variety of bus architectures. System 
memory 1207 includes read only memory (ROM) 1252 and random access 

20 memory (RAM) 1254. A basic input/output system (BIOS) 1256, containing the 
basic routines that help to transfer information between elements within personal 
computer system 1201, such as during start-up, is stored in the ROM 1252. 
System 1201 further includes various drives and associated computer-readable 
media. A hard disk drive 1209 reads from and writes to a (typically fixed) 

25 magnetic hard disk 121 L An additional (possible optional) magnetic disk drive 
1213 reads from and writes to a removable "floppy" or other magnetic disk 1215. 
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An optical disk drive 1217 reads from and, in some configurations, writes to a 
removable optical disk 1219 such as a CD ROM or other optical media. Hard disk 
drive 1209 and optical disk drive 1217 are connected to system bus 1207 by a hard 
disk drive interface 1221 and an optical drive interface 1225, respectively. The 

5 drives and their associated computer-readable media provide nonvolatile storage of 
computer-readable instructions, data structures, program modules, game programs 
and other data for personal computer system 1201 . In other configurations, other 
types of computer-readable media that can store data that is accessible by a 
computer (e.g., magnetic cassettes, flash memory cards, digital video disks, 

10 Bernoulli cartridges, random access memories (RAMs), read only memories 
(ROMs) and the like) may also be used. 

A number of program modules including emulator 1303 may be stored on 
the hard disk 1211, removable magnetic disk 1215, optical disk 1219 and/or the 
ROM 1252 and/or the RAM 1254 of system memory 1205. Such program 

15 modules may include an operating system providing graphics and sound APIs, one 
or more application programs, other program modules, program data and game 
data. A user may enter commands and information into personal computer system 
1201 through input devices such as a keyboard 1227, pointing device 1229, 
microphones, joysticks, game controllers, satellite dishes, scanners, or the like. 

20 These and other input devices can be connected to processing unit 1203 through a 
serial port interface 1231 that is coupled to system bus 1207, but may be connected 
by other interfaces, such as a parallel port, game port Fire wire bus or a universal 
serial bus (USB). A monitor 1233 or other type of display device is also connected 
to system bus 1207 via an interface, such as a video adapter 1235. 

25 System 1201 may also include a modem 1 154 or other network interface 

means for establishing communications over a network 1 152 such as the Internet. 
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Modem 1 154, which may be internal or external, is connected to system bus 123 
via serial port interface 1231. A network interface 1 156 may also be provided for 
allowing system 1201 to communicate with a remote computing device 1150 (e.g., 
another system 1201) via a local area network 1 158 (or such communication may 
5 be via wide area network 1 152 or other communications path such as dial-up or 
other communications means). System 1201 will typically include other peripheral 
output devices, such as printers and other standard peripheral devices. 

In one example, video adapter 1235 may include a 3D graphics pipeline chip 
set providing fast 3D graphics rendering in response to 3D graphics commands 

10 issued based on a standard 3D graphics application programmer interface such as 
Microsoft's DirectX 7.0 or other version. A set of stereo loudspeakers 1237 is also 
connected to system bus 1207 via a sound generating interface such as a 
conventional "sound card" providing hardware and embedded software support for 
generating high quality stereophonic sound based on sound commands provided by 

15 bus 1207. These hardware capabilities allow system 1201 to provide sufficient 
graphics and sound speed performance to play software stored in storage medium 
62. 

All documents referenced above are hereby incorporated by reference. 

While the invention has been described in connection with what is presently 
20 considered to be the most practical and preferred embodiment, it is to be 

understood that the invention is not to be limited to the disclosed embodiment, but 
on the contrary, is intended to cover various modifications and equivalent 
arrangements included within the scope of the appended claims. 
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We Claim: 

1 LA method of performing embossed-style bump mapping comprising: 

2 providing a vertex attribute descriptor that describes Tangent and Binormal 

3 vector data for each of plural vertices of a polygon; 

4 computing a light direction vector; 

5 computing texture coordinate displacements for each of said vertices in 

6 response to said light direction vector and said Tangent and Binormal vector data; 

7 generating texture coordinate values in response to said computed texture 

8 coordinate displacements; and 

9 texture mapping said polygon based on said texture coordinate values to 

10 provide an embossing effect; 

1 1 wherein the texture coordinate displacements computing step does not use 

12 Normal vector data to compute said texture coordinate displacements. 

1 2. A method as in claim 1 where said per- vertex data includes at least a 

2 Normal, a Tangent, and a Binormal vector. 

1 3. A method as in claim 1 further including transforming the Tangent and 

2 Binomial vectors to eye-space. 

1 4. A method as in claim 1 wherein the texture coordinate displacement 

2 computing step includes performing vector dot-product computations between the 

3 computed light direction vector and the Tangent and Binormal vectors. 

1 5. A method as in claim 1 wherein the light direction vector computing step 

2 comprises computing at least a normalized light-to- vertex vector. 

1 6. A method of performing embossed-style bump mapping comprising: 
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2 decoding a generalized texture coordinate generation function that specifies 

3 generating embossed-style bump-mapping texture displacements; 

4 perturbing input texture coordinates based on binormals and light direction 

5 information in response to the decoding step; 

6 using the input texture coordinates and the perturbed texture coordinates to 

7 look up texels in a height-field bump map; 

8 computing bump height information based on the texel values; and 

9 combining the bump height information with pixel color value information 

10 to provide embossed-style bump mapping; 

1 1 wherein the combining step is performed in texture processing hardware. 

1 7. The method of claim 6 wherein the perturbing step is performed in 

2 hardware. 

1 8. The method of claim 6 wherein the perturbing step comprises computing 

2 texture coordinate displacement values based a Tangent vector and a Binormal 

3 vector and a dot-product of each vector with a light direction vector. 

1 9. The method of claim 6 wherein the computing step comprises subtracting 

2 texel data acquired using perturbed texture coordinates from texel data acquired 

3 using input texture coordinates. 

1 10. The method of claim 6 wherein the decoding step comprises decoding a 

2 generalized vertex attribute description function that specifies a Tangent vector and 

3 a Binormal vector. 

1 11. The method of claim 10 wherein the Tangent and Binormal vectors are 

2 specified by reference to separate memory indexes. 
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1 12. In a graphics system including a processor and a separate graphics 

2 processing pipeline having transformation and lighting circuitry, the pipeline 

3 performing emboss-style bump-mapping based on texels in response to texture 

4 coordinate displacements computed from Tangents and Binomials, an 

5 improvement comprising: 

6 texture coordinate displacement computation circuitry included within the 

7 graphics pipeline vertex transformation and lighting circuitry. 

1 13. In a graphics system including a processor and a separate graphics 



2 processing pipeline, the pipeline performing emboss-style bump-mapping based on 

3 texels in response to textures coordinate displacements computed from Tangents 

4 and Binomials, an improvement comprising performing the texture coordinate 

5 displacement computation within the pipeline. 

1 14. A graphics processing system having vertex transformation and lighting 

2 processing hardware and an enhanced API vertex attribute description command 

3 function for specifying at least Tangent and Binormal object-space surface vectors, 

4 wherein the geometry processing and lighting hardware transforms the object- 

5 space Tangent and Binormal vectors to eye-space, computes an eye-space light 

6 direction vector based a light position and a vertex position, and performs vector 

7 dot-product computations between the computed light direction vector and the 

8 transformed Tangent and Binormal surface vectors to generate texture coordinate 

9 displacements for use in creating an embossed texture effect, and wherein the 

10 Tangent and Binormal vectors are scaled by scaling a model view matrix and 

1 1 applying the scaled model view matrix to the Tangent and Binormal vectors. 

1 15. In a graphics processing system that renders and displays images at least 

2 in part in response to polygon vertex attribute data and texture color data stored in 
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3 an associated memory, a vertex transformation and lighting processing portion 

4 embodied in hardware, comprising: 

5 a vector processing unit comprising at least two distinct dot-product 

6 computation circuits for computing vector dot-products; and 

7 a bump-mapping unit for computing at least a normalized light-to-vertex 

8 vector and a set of texture displacement values for use in creating an embossed 

9 texture effect. 

1 16. In a graphics processing system that renders and displays images at least 

2 in part in response to polygon vertex attribute data and texture color data stored in 

3 an associated memory, vertex transformation and lighting processing hardware 

4 comprising: 

5 vector processing circuitry comprising a plurality of dot-product 

6 computation units; and 

7 bump-mapping circuitry comprising an inverse square-root computation 

8 circuit for computing a reciprocal of a square-root of an input value and at least 

9 one floating point multiplier and one floating point adder. 

1 17. The geometry vertex and lighting processing hardware of claim 16 

2 wherein the bump-mapping unit further comprises a FIFO input buffer for storing 

3 incoming texture coordinate values. 

1 18. The vertex transformation and lighting processing hardware of claim 16 

2 wherein the bump-mapping unit further comprises a floating point adder for 

3 computing a light-to- vertex vector. 

1 19. The vertex transformation and lighting processing hardware of claim 16 

2 wherein the dot-product computation units comprise at least one floating point 

3 multiplier and one floating point adder. 
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1 20. In a graphics processing system that renders and displays images at least 

2 in part in response to polygon vertex attribute data and texture color data stored in 

3 an associated memory, the graphics system including vertex transformation and 

4 lighting processing hardware for generating texture coordinate displacement values 

5 for implementing at least embossed-style bump-mapped texture effects, the vertex 

6 transformation and lighting processing hardware comprising: 



7 a first vector dot-product computation unit for transforming vector data to 

8 eye-space; 

9 a second vector dot-product computation unit for computing lighting 

1 0 direction vector dot-products ; 

11 an inverse square-root computation unit for computing a reciprocal of a 

12 square-root of an input value; 

13 at least one floating point multiplier unit; and 

14 at least one floating point adder unit: 

15 wherein in response to an API bump mapping function instruction the 



16 vertex transformation and lighting processing hardware transforms vector data per- 

17 vertex into eye-space, computes texture coordinate displacement values based on 

18 lighting direction vector dot-products, and adds the displacement values to texture 

19 coordinates for use in producing emboss-style bump-mapped texture effects. 

1 21. The graphics processing system of claim 20 further including a graphics 

2 API vertex attribute function which specifies at least Normal, Tangent and 

3 Binormal vectors per vertex, or specifies an index to at least each of these vectors 

4 stored in memory. 
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1 22. The graphics processing system of claim 20 wherein the API bump 

2 mapping function is defined to specify one of at least eight different textures for 

3 producing embossing effects. 

1 23. The graphics processing system of claim 20 wherein the Normal, 

2 Tangent and Binormal vectors each comprise three 32-bit vector elements. 

1 24. In a graphics processing system that renders and displays images at least 



2 in part in response to polygon vertex attribute data and texture color data stored in 

3 an associated memory, the graphics system including a geometry transform unit 

4 comprising hardware for at least computing a coordinate-space transformation and 

5 a vector dot-product, a method of implementing embossed- style bump-mapped 

6 texture effects in graphics rendering system, comprising the steps of: 



7 storing a texture data image in memory, the texture data image comprising 

8 color values parameterized by at least two coordinate values representing two 

9 orthogonal axes for mapping the image: 

10 supplying light position information, texture coordinate information, vertex 



1 1 position information and object-space Normal. Binormal and Tangent vector data 

12 per polygon vertex to the geometry transform unit, wherein for each vertex said 

13 Binormal and Tangent vector data map respectively, in an object-space coordinate 

14 system, to each orthogonal axis of the bump-map image; 



15 transforming the object-space Normal. Binormal and Tangent vector data to 

16 an eye-space coordinate system; 

17 computing a light direction vector from light position and vertex position 

18 information; 
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19 computing a texture coordinate displacement based on a vector dot-product 

20 between the light direction vector and each of the Binormal and Tangent eye -space 

2 1 vector components; 

22 adding the texture coordinate displacement to eye-space texture coordinates 

23 to obtain a set of displaced texture coordinates; 

24 using the set of displaced texture coordinates to retrieve texture color data 

25 from the stored texture data image; and 

26 performing texture subtraction in one pass. 

1 25. A method of performing embossed-style bump mapping comprising: 

2 providing a description of Tangent and Binormal vectors for each of plural 

3 vertices of a polygon; 

4 providing a light direction vector; 

5 computing texture coordinate displacements for each of said vertices in 

6 response to said light director vector and said Tangent and Binormal vector; 

7 generating texture coordinates in response to said computed texture 

8 coordinated displacements; and 

9 texture mapping said polygon based on said texture coordinates, including 

10 providing a texture combining operation that performs textile subtraction in a 

11 single pass. 

1 26. The method of claim 25 wherein said texture combining is performed in 

2 texture hardware. 

1 27. The method of claim 25 further including scaling the Tangent and 

2 Binomial vector data by scaling a model view matrix and applying the scaled 

3 model view matrix to the vector data. 
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1 28. The method of claim 25 wherein the texture coordinate displacement 

2 computing does not use a Normal vector. 

1 29. The method of claim 25 wherein the texture coordinate displacement 

2 computing computes the following in parallel: 

3 a first vector dot-product between the light direction vector and the Tangent 

4 vector, 

5 a second vector dot-product between the light direction vector and the 

6 Binormal vector, and 

7 the square of the light direction vector. 

1 30. The method as in claim 25 wherein the texture coordinate displacement 



2 computing step is performed using two distinct dot-product computation units, the 

3 first dot-product computation unit computing eye-space transformation of the 

4 Tangent and Binormal vectors, the second dot-product computation unit computing 

5 at least vector dot-products between the light direction vector and each of the 

6 Tangent and Binormal vectors. 



1 3 1 . In a graphics chip including a logic array, a pipelined arrangement 

2 implemented within the logic array that performs embossed-style bump mapping 

3 based on Tangent and Binormal vectors for each of plural vertices of a polygon 

4 and a light direction vector, said arrangement including: 

5 a dot-product computation unit and associated logic circuitry adapted to 

6 receive a scaling factor, the dot-product computation unit and associated logic 

7 circuitry scaling a model view matrix in response to the scaling factor and applying 

8 the scaled model view matrix to the Tangent and Binormal vectors to provide 

9 texture coordinate displacements for each of said vertices; and 
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10 texture mapping and combining circuitry that generates embossing effects in 

1 1 response to said texture coordinate displacements. 

1 32. Apparatus as in ciaim 31 wherein said texture mapping and combining 

2 circuitry performs texture subtraction in one pass. 

1 33. Apparatus as in claim 3 1 wherein the dot-product computation unit does 

2 not use the Normal input vector to compute displacements. 

1 34. Apparatus as in claim 3 1 further including a further dot-product 

2 computation unit that parallelly computes dot products between the Binormal and 

3 Tangent vectors and a light direction vector. 
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Abstract Of The Disclosure 

A graphics system including a custom graphics and audio processor 
produces exciting 2D and 3D graphics and surround sound. The system includes a 
graphics and audio processor including a 3D graphics pipeline and an audio digital 
signal processor. Emboss style effects are created using fully pipelined hardware 
including two distinct dot-product computation units that perform a scaled model 
view matrix multiply without requiring the Normal input vector and which also 
compute dot-products between the Binomial and Tangent vectors and a light 
direction vector in parallel. The resulting texture coordinate displacements are 
provided to texture mapping hardware that performs a texture mapping operation 
providing texture combining in one pass. The disclosed pipelined arrangement 
efficiently provides interesting embossed style image effects such as raised and 
lowered patterns on surfaces. 
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